US 20020133637A1 


d9) United States 

(12) Patent Application Publication ( m » Pub. No.: US 2002/0133637 Al 
POPP et al. ( 43 ) Pub. Date: Sep. 19, 2002 


(54) METHOD AND APPARATUS FOR 

GENERATING OBJECT-ORIENTED WORLD 
WIDE WEB PAGES 

(76 ) Inventors: NICOLAS POPP, MENLO PARK, CA 
(US); BRUCE ONG, SAN 
FRANCISCO, CA (US) 

Correspondence Address: 

THE HECKER LAW GROUP 
1925 CENTURY PARK EAST 
SUITE 2300 

LOS ANGELES, CA 90067 (US) 

( * ) Notice: This is a publication of a continued pros- 

ecution application (CPA) filed under 37 
CFR 1.53(d). 

(21) Appl. No.: 08/515,057 


(22) Filed: Aug. 14, 1995 

Publication Classification 

(51) Int. Cl. 7 G06F 9/44 

(52) U.S. Cl 709 315 

(57) ABSTRACT 


In the present invention, HTML elements are mapped to 
objects in an object-oriented environment. Classes of objects 
are defined for each HTML element as well as the HTML 
document (or page). By providing a one-to-one mapping 
between each HTML element and object classes, HTML 
documents can be manipulated programmatically. The prop- 
erties of each element are stored in instance variables of the 
associated object. Each object class can include methods to 
manipulate the HTML element within an HTML document. 
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Figure 1 
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Figure 3 
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Figure 5 
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Figure 6A 
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Figure 6B 

















Patent Application Publication Sep. 19, 2002 Sheet 8 of 9 


US 2002/0133637 A1 


702 


r 


Nicolas, 


This is a list of your unread mail: 


Sender: 
Subject: 
Content: 

\ 


/ 

704 


Bruce :^S e : 

Meeting at 10:00 am 
Please contact me if you cannot 
attend a meeting this Thursday 
at 10:00 am. 


Sender: Karen ; |nuge; 

Subject: Meeting at 10:30 am 

Content: There will be a department 

meeting this Thursday at 10:30 am. 


Sender: Jeff 4 ni ?gf; 

Subject: Lunch on Thursday 

Content: How about lunch this Thursday? 


Figure 7 A 
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Figure 7 B 




US 2002/0133637 A1 


1 


Sep. 19, 2002 


METHOD AND APPARATUS FOR GENERATING 
OBJECT-ORIENTED WORLD WIDE WEB PAGES 

BACKGROUND OF THE INVENTION 
[0001] 1. Field of the Invention 

[0002] This invention relates to the generation of object- 
oriented world wide web pages 

[0003] 2. Background 

[0004] The Internet is a worldwide matrix of intercon- 
nected computers. An Internet client accesses a computer on 
the network via an Internet provider. An Internet provider is 
an organization that provides a client (eg., an individual or 
other organization) with access to the Internet (via analog 
telephone line or Integrated Services Digital Network line, 
for example). A client can, for example, download a file 
from or send an electronic mail message to another com- 
puter/client using the Internet. 

[0005] To retrieve a file on the Internet, a client must 
search for the file, make a connection to the computer on 
which the file is stored, and download the file. Each of these 
steps may involve a separate application and access to 
multiple, dissimilar computer systems. The World Wide 
Web (WWW) was developed to provide a simpler, more 
uniform means for accessing information on the Internet. 

[0006] The components of the WWW include browser 
software, network links, and servers. The browser software, 
or browser, is a user-friendly interface (i.e., front-end) that 
simplifies access to the Internet. A browser allows a client to 
communicate a request without having to learn a compli- 
cated command syntax, for example. A browser typically 
provides a graphical user interface (GUI) for displaying 
information and receiving input. Examples of browsers 
currently available include Mosaic, Netscape, and Cello. 

[0007] Information servers maintain the information on 
the WWW and are capable of processing a client request. 
Hypertext Transport Protocol (HTTP) is the standard pro- 
tocol for communication with an information server on the 
WWW. HTTP has communication methods that allow cli- 
ents to request data form a server and send information to the 
server. 

[0008] To submit a request, the client contacts the HTTP 
server and transmits the request to the HTTP server. The 
request contains the communication method requested for 
the transaction (e.g., GET an object from the server or POST 
data to an object on the server). The HTTP server responds 
to the client by sending a status of the request and the 
requested information. The connection is then terminated 
between the client and the HTTP server. 

[0009] A client request therefore, consists of establishing 
a connection between the client and the HTTP server, 
performing the request, and terminating the connection. The 
HTTP server does not retain any information about the 
request after the connection has been terminated. HTTP is, 
therefore, a stateless protocol. That is, a client can make 
several requests of an HTTP server, but each individual 
request is treated independent of any other request. The 
server has no recollection of any previous request. 

[0010] An addressing scheme is employed to identify 
Internet resources (e.g., HTTP server, file or program). This 


addressing scheme is called Uniform Resource Locator 
(URL). A URL contains the protocol to use when accessing 
the server (e.g., HTTP), the Internet domain name of the site 
on which the server is running, the port number of the server, 
and the location of the resource in the file structure of the 
server. 

[0011] The WWW uses a concept known as hypertext. 
Hypertext provides the ability to create links within a 
document to move directly to other information. To activate 
the link, it is only necessary to click on the hypertext link 
(e.g., a word or phrase). The hypertext link can be to 
information stored on a different site than the one that 
supplied the current information. A URL is associated with 
the link to identify the location of the additional information. 
When the link is activated, the client’s browser uses the link 
to access the data at the site specified in the URL. 

[0012] If the client request is for a file, the HTTP server 
locates the file and sends it to the client. An HTTP server 
also has the ability to delegate work to gateway programs. 
The Common Gateway Interface (CGI) specification defines 
the mechanisms by which HTTP servers communicate with 
gateway programs. A gateway program is referenced using 
a URL. The HTTP server activates the program specified in 
the URL and uses CGI mechanisms to pass program data 
sent by the client to the gateway program. Data is passed 
from the server to the gateway program via command-line 
arguments, standard input, or environment variables. The 
gateway program processes the data and returns its response 
to the server using CGI (via standard input, for example). 
The server forwards the data to the client using the HTTP. 

[0013] A browser displays information to a client/user as 
pages or documents. A language is used to define the format 
for a page to be displayed in the WWW. The language is 
called Hypertext Markup Language (HTML). A WWW page 
is transmitted to a client as an HTML document. The 
browser executing at the client parses the document and 
produces a displays a page based on the information in the 
HTML document. 

[0014] HTML is a structural language that is comprised of 
HTML elements that are nested within each other. An 
HTML document is a text file in which certain strings of 
characters, called tags, mark regions of the document and 
assign special meaning to them. These regions are called 
HTML elements. Each element has a name, or tag. An 
element can have attributes that specify properties of the 
element. Blocks or components include unordered list, text 
boxes, check boxes, radio buttons, for example. Each block 
has properties such as name, type, and value. The following 
provides an example of the structure of an HTML document: 


[0015] 

<HTML> 

[0016] 

<HEAD> 

[0017] 

. . . element(s) valid in the document head 

[0018] 

</HEAD> 

[0019] 

<BODY> 

[0020] 

. . . element(s) valid in the document body 

[0021] 

</BODY> 

[0022] 

</HTML> 
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[0023] Each HTML element is delimited by the pair of 
characters “<” and “>” The name of the HTML element is 
contained within the delimiting characters. The combination 
of the name and delimiting characters is referred to as a 
marker, or tag. Each element is identified by its marker. In 
most cases, each element has a start and ending marker. The 
ending marker is identified by the inclusion of an another 
character, “/” that follows the “<” character. 

[0024] HTML is a hierarchical language. With the excep- 
tion of the HTML element, all other elements are contained 
within another element. The HTML element encompasses 
the entire document. It identifies the enclosed text as an 
HTML document. The HEAD element is contained within 
the HTML element and includes information about the 
HTML document. The BODY element is contained within 
the HTML. The BODY element contains all of the text and 
other information to be displayed. Other HTML elements 
are described in an HTML reference manual. 

[0025] The prior art HTML is not object-oriented. An 
HTML element is contained as a string within a flat, ASCII 
file. An application must be written to manipulate an HTML 
file. It would be beneficial to have the ability to map HTML 
elements to classes of objects that define the behavior of 
HTML elements. 

SUMMARY OF THE INVENTION 

[0026] In the present invention, HTML elements are 
mapped to objects in an object-oriented environment. 
Classes of objects are defined for each HTML element as 
well as the HTML document (or page). By providing a 
one-to-one mapping between each HTML element and 
object classes, HTML documents can be manipulated pro- 
grammatically. The properties of each element are stored in 
instance variables of the associated object. Each object class 
can include methods to manipulate the HTML element 
within an HTML document. 

[0027] An HTML document defines a World Wide Web 
page. An HTML document can be generated using HTML 
templates. Multiple HTML templates can be used to gener- 
ate a single HTML document. An HTML template consists 
of HTML element statements. A parser parses an HTML 
template and generates an object tree. 

[0028] The object tree is traversed during HTML docu- 
ment generation. When the HTML document is rendered, or 
generated, the root of the object tree is sent a message to 
create the document. The root object processes this message 
by creating its corresponding HTML element statement(s). 
The "create” message is then forwarded by the parent object 
to its children. This process is repeated until all of the objects 
in the object tree receive the message. Once all of the objects 
process the "create” message, the HTML document is gen- 
erated. 

[0029] An HTML document can also be generated without 
using an HTML template. An object tree can be created 
dynamically during runtime. For example, a root object 
(e.g., a PAGE object) is instantiated at runtime. During 
processing, additional HTML objects can be instantiated at 
inserted into the object tree. For example, a BODY object is 
inserted in the object tree as a child of the root object based 
on logic contained in an application procedure. Attributes of 
the BODY object can be set during application processing. 


Additional HTML objects can be added to the object tree in 
a similar manner until all of the objects have been 
assembled. As previously described, the HTML document is 
generated by sending a “create” message to the objects in the 
object tree. 

[0030] An HTML template can include a server-side 
HTML extension known as the group extension. A group 
extension provides the ability to create a block of HTML 
statements. A name attribute of the group extension provides 
the ability to identify the group. The name attribute can be 
stored in a hash table. During processing, the group identity 
can be retrieved and used to traverse an object tree. 

[0031] The group extension also ensures the scoping of 
named elements A group object maintains a hash table that 
includes the named elements which are its members. The 
group element scopes the named elements within itself. 
Therefore, two elements having the same name in different 
groups are distinguishable. Each one is scoped to its respec- 
tive group. 

[0032] A declarations file is used in combination with the 
group extension. The declarations file contains additional 
definition for a group extension. An entry in the declarations 
file includes a label that links the entry to the group 
extension. The entry also declares an HTML object. That is, 
the class of the HTML object is defined. Properties of the 
HTML object can also be defined within the entry. Values for 
properties provided in the object class definition can be used 
to populate the properties in an instance of the object class 
instantiated for the group. 

[0033] A declaration entry modifies its associated group 
by adding element to the group or modifying the elements 
that already exist in the group. For example, an HTML 
object, or element, declared in the entry inserts itself into the 
group that bears the same name as the declaration entry. 
Property values that are declared in the declaration entry are 
used to modify the HTML object’s properties. 

[0034] An instance of the group extension contained in an 
HTML template is not included in the HTML document that 
is sent to the client browser. Therefore, the client browser 
does not need to recognize the group extension. When the 
HTML document is rendered, the HTML objects contained 
within a group object render themselves to generate the 
HTML for the group. The HTML objects generate actual 
HTML statements within the HTML document. That is, the 
group does not generate HTML statements. However, the 
group transmits a message to its children (e.g., an HTML 
object that inserted itself within the group) to generate 
HTML statements. 

[0035] A declaration entry may contain, for example, a 
declaration for a text string HTML object. The STRING 
HTML object is inserted into the group identified by the 
entry. When the HTML document is rendered, the STRING 
HTML object generates the HTML statements necessary to 
insert the text string into the HTML document. 

[0036] A group extension also provides the ability to 
identify a block of HTML as a repeating subcomponent of 
the HTML document. The block identified by the group 
extension can be repeated multiple times within the HTML 
document. A repeating group can be used to render HTML 
statements that contain data that is retrieved from an external 
data source, for example. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0037] FIG. 1 provides an example of a general purpose 
computer to be used with the present invention. 

[0038] FIG. 2 provides a block-level diagram that illus- 
trates components of the present invention. 

[0039] FIG. 3 provides an example of a flow for process- 
ing a Web page using the object-oriented capabilities of the 
invention. 

[0040] FIG. 4 provides a diagram of the object tree or 
hierarchy for an HTML template provided in Table One. 

[0041] FIG. 5 provides an example of a flow for process- 
ing stored templates for an HTML document. 

[0042] FIG. 6A provides an example of a process flow to 
parse a template and generate an object tree. 

[0043] FIG. 6B modifies the flow provided in FIG. 6A to 
include steps to add tag name to a hash table. 

[0044] FIG. 7A provides an example of a Web page that 
contains a repeating element. 

[0045] FIG. 7B provides an example of the object tree for 
a repeating subcomponent of a Web page. 

DETAILED DESCRIPTION OF THE 
INVENTION 

[0046] A method and apparatus for generating object- 
oriented World Wide Web pages is described. In the follow- 
ing description, numerous specific details are set forth in 
order to provide a more thorough description of the present 
invention. It will be apparent, however, to one skilled in the 
art, that the present invention may be practiced without these 
specific details. In other instances, well-known features have 
not been described in detail so as not to obscure the 
invention. 

[0047] The present invention can be implemented on a 
general purpose computer such as illustrated in FIG. 1. A 
keyboard 110 and mouse 111 are coupled to a bi-directional 
system bus 118. The keyboard and mouse are for introducing 
user input to the computer system and communicating that 
user input to CPU 113. The computer system of FIG. 1 also 
includes a video memory 114, main memory 115 and mass 
storage 112, all coupled to bi-directional system bus 118 
along with keyboard 110, mouse 111 and CPU 113. The 
mass storage 112 may include both fixed and removable 
media, such as magnetic, optical or magnetic optical storage 
systems or any other available mass storage technology. Bus 
118 may contain, for example, 32 address lines for address- 
ing video memory 114 or main memory 115. The system bus 
118 also includes, for example, a 32-bit DATA bus for 
transferring DATA between and among the components, 
such as CPU 113, main memory 115, video memory 114 and 
mass storage 112. Alternatively, multiplex DATA/address 
lines may be used instead of separate DATA and address 
lines. 

[0048] In the preferred embodiment of this invention, the 
CPU 113 is a 32-bit microprocessor manufactured by 
Motorola, such as the 680x0 processor or a microprocessor 
manufactured by Intel, such as the 80x86, or Pentium 
processor. However, any other suitable microprocessor or 
microcomputer may be utilized. Main memory 115 is com- 


prised of dynamic random access memory (DRAM). Video 
memory 114 is a dual-ported video random access memory. 
One port of the video memory 114 is coupled to video 
amplifier 116. The video amplifier 116 is used to drive the 
cathode ray tube (CRT) raster monitor 117. Video amplifier 
116 is well known in the art and may be implemented by any 
suitable means. This circuitry converts pixel DATA stored in 
video memory 114 to a raster signal suitable for use by 
monitor 117. Monitor 117 is a type of monitor suitable for 
displaying graphic images. 

[0049] The computer system described above is for pur- 
poses of example only. The present invention may be 
implemented in any type of computer system or program- 
ming or processing environment. When a general purpose 
computer system such as the one described executes the 
processes and process flows described herein, it is a means 
for generating an HTML page using object-oriented HTML 
elements. 

[0050] In the present invention, HTML elements are 
mapped to objects in an object-oriented environment. 
Classes of objects are defined for each HTML element as 
well as the HTML document (or page). Each HTML element 
can be mapped to a corresponding object class (e.g., Objec- 
tive C, C++, or SmallTalk object class). By providing a 
one-to-one mapping between each HTML element and 
object classes, HTML documents can be manipulated pro- 
grammatically. The properties of each element are stored in 
instance variables of the associated object. Each object class 
can include methods to manipulate the HTML element 
within an HTML document. 

[0051] The object-oriented approach used by the present 
invention provides the ability to dynamically generate a 
page, or HTML document. Some or all of a page can be 
generated based on input received in a receive form, gen- 
erated at runtime, or retrieved from an external data source 
(e.g., database or electronic mail system). The dynamic 
generation of HTML documents is more fully described in 
a co-pending U.S. patent application entitled “Method and 
Apparatus for Processing Internet Transactions”, Ser. 

No. , filed on Aug. 15, 1995 and incorporated herein 

by reference. 

[0052] As previously indicated, the present invention 
objectifies (Objective C, C++ objects, SmallTalk objects, 
etc.) HTML elements. By doing so, the HTML objects can 
be connected to other objects. That is, the HTML objects can 
be connected to objects in the business model (i.e., data 
model objects). Internet transaction processing is more fully 
described in a co-pending U.S. patent application entitled 
“Method and Apparatus for Processing Internet Transactions 

”, Ser. No. . filed on Aug. 15, 1995 and incorporated 

herein by reference. 

[0053] The present invention provides the ability to access 
an application using any interface. For example, a client can 
access the same application via the Internet using a system 
running Windows, MAC/OS, Sun OS, NextStep, etc. Refer- 
ring to FIG. 2, client 202 is resident on a corporate network 
200. Corporate network 200 is a local area network com- 
prised of personal computers such as client 202, for 
example. Client 202 executes a browser such as Netscape or 
Mosaic to interface with World Wide Web (WWW) 204. The 
browser executing on client 202 displays HTML pages 
transmitted to client 202 via WWW 204. When the client 
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browser submits a request contained within an HTML page, 
the page is transmitted via WWW 204 to HTTP server 206. 
One attribute of the submitted page can be a Universal 
Resource Locator (URL). A URL specifies procedures to be 
executed and/or it data input by the user, for example. 

[0054] HTTP server 206 can service the client request by, 
for example, retrieving a return form or file (specified by a 
URL contained in the HTML document) and transmitting it 
to client 202 via WWW 204. Further, HTTP server 206 can 
initiate a program identified in the URL, In the preferred 
embodiment, the URL specifies CGIMessenger 210 to ser- 
vice the client request. CGIMessenger is a Common Gate- 
way Interface (CGI) script. The following provides an 
example of a URL: 


The behavior defined by the objects manipulate the HTML 
elements based on the information available to the objects, 
for example. 

[0059] Preferably, a user interface (UI) is used to define 
the HTML elements of a Web page. However, HTML 
elements can be defined using a text editor. The output of 
either a UI tool or a text editor is an ASCII, or flat, file that 
contains definitions for one or more HTML elements. The 
HTML elements are arranged within the file in the defini- 
tional hierarchy (e.g., BODY element is contained within the 
HTML element). 

[0060] The HTML elements that define a Web page can be 
included in a single ASCII file. A file that contains HTML 
elements is referred to as an HTML template. An HTML 


http.//www next.com/cgi_bin/cgimessenger/app_name/rcv_form/send_form 


[0055] A URL has multiple parts. The first part, “http:” 
specifies the protocol to be used. Other examples of protocol 
include “ftp:”, “gopher:” and “wais”. The next part specifies 
the Internet domain name of the server. In this case, the 
Internet domain name of the HTTP server is "www.next- 
.com”. The next part, “/cgi_bin/cgi_messenger” specifies the 
path and filename of the desired resource. The resource can 
be a file, form, or program, for example. In this case, the 
resource is a common gateway interface program that is 
used to route requests to applications. The application that 
processes the client request is specified in the URL with the 
“ a PP_ name ” specification. The names of the receiving form 
(e.g., the form containing the client request) and the sending 
form (e.g., a reply to the client request) can be included in 
the URL. 

[0056] A client request can result in the invocation of 
application 214 in the server domain 208 for example. The 
client request is received by the HTTP server 206. The 
HTTP server 206 initiates CGIMessenger 210. To service 
the client request, CGIMessenger 210 communicates with 
HTTP server 206 to obtain information regarding the client 
request. For example, CGIMessenger 210 obtains the infor- 
mation that accompanied the user request such as form 
and/or application names and/or user input. The form names 
can include the name of the form that was submitted to 
initiate the client request as well as a return form. 

[0057] The application name identifies the application that 
services the client request. If an application is specified, 
CGIMessenger 210 transmits the client request and corre- 
sponding information transmitted from HTTP Server 204 to 
application 214. Application 214 executes an interaction 
flow to satisfy the user request. 

[0058] A simple hypertext request (e.g., a request for 
another form specified in the request) is serviced by HTTP 
Server 206 by returning the specified form. For requests 
other than simple hypertext requests, the present invention 
can execute logic such as CGIMessenger 210 and applica- 
tion 214, for example, to process a client request. The 
application includes objects such as those created in an 
objective C, C++, or SmallTalk environment to manage the 
components of an HTML document (i.e., HTML elements). 


template can include a complete definition of the Web page. 
However, an HTML template may define a subset of the 
Web page. Two or more HTML templates that define por- 
tions of the Web page can be combined to provide a 
complete definition. The following provides an example of 
the contents of an HTML template: 

TABLE ONE 

<HTML> 

<BODY> 

This is a list of employees: 

<P> 

<OL> 

<LI> James </LI> 

<LI> Mary </LI> 

</OL> 

</BODY> 

</HTML> 


[0061] The above HTML template includes HTML, 
BODY, Paragraph (P), Ordered List (OL),and List Item (LI) 
elements. The present invention defines a class for each 
HTML element. An object class defines instance variables to 
store information associated with the HTML element such as 
attributes, or properties, of an HTML element. Methods can 
be defined to manipulate an HTML element to render (i.e., 
generate) an HTML document, for example. 

[0062] Referring to FIG. 2, application 214 includes 
objects 216. Objects 216 correspond to the HTML elements 
that define a WWW page. Objects 216 are arranged in a tree 
structure that corresponds to the hierarchical structure of the 
HTML elements that they implement. To render an HTML 
document, application 214 sends a message to the root 
object 218 to render itself. In response to this message, root 
object 218 renders the corresponding HTML element(s). 
Root object 218 sends a message to its children, objects 
220A-220B, to render themselves. This process is continued 
until all of the nodes in the object tree have been processed. 

[0063] The objects in an object tree are instantiated based 
on pre-defined object classes. Object classes are defined to 
manipulate and generate HTML elements in an HTML 
document. The name of the object class can mirror the 
names of the corresponding HTML element to provide an 
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easy association between the two. Alternatively, any map- 
ping scheme can be used to map an HTML element to an 
object class. Instance variables can be used to store property 
information defined by properties in the HTML element 
statement. An object class can further contain instance 
variable declarations to store other information associated 
with an HTML element. 

[0064] An object class can have associated method dec- 
larations that can be used to manipulate the HTML docu- 
ment. For example, an object class may have a method that 
varies the display characteristics of the HTML element 
based on the outcome of some Boolean logic. Further, 
instance variables can be used to provide a pointer to scripts 
used with the HTML document. 

[0065] A script is a type of program that consists of a set 
of instructions that is read and interpreted by another appli- 
cation. An instruction is an expression that conforms to the 
rules and syntax expected by a script interpreter. A script 
language (set of script instructions) may include simple 
control statements such as loops and if/then expressions. 

[0066] Since the language does not include complex con- 
structs, the language is easy to master. As with other 
interpretive languages, a developer does not have to perform 
steps to translate a script into machine-readable code. Even 
though a script is interpreted at runtime, it is not considered 
to cause any degradation in the overall response time (the 
actual network response times are the primary concern). 
Therefore, scripts are typically used to provide the logic 
associated with a Web page. 

[0067] Appendix A provides a list of some object classes 
for HTML elements. Additional or modified class definitions 
can also be used with the present invention. A library of 
object classes are preferably predefined. 

[0068] FIG. 3 provides an example of a flow for process- 
ing a Web page using the object-oriented capabilities of the 
invention. At step 302, a Web page developer creates the 
HTML templates to provide a Web page definition. Alter- 
natively, the Web page can be defined dynamically at 
runtime without using an HTML template. In this case, there 
is no need for the developer to perform step 302. The 
developer generates a script and declarations files needed to 
process the Web page at step 304. A declarations file 
contains information that is used to generate the Web page. 
The HTMLtemplate(s), script(s), and declarations file(s) are 
stored for use when the Web page definition is to be sent to 
a client browser. 

[0069] When the Web page definition is requested by a 
client browser, the associated HTML template(s), script(s), 
and declarations are retrieved from storage. At step 306, a 
parser is invoked to parse the statements contained in the 
HTML template(s). HTML elements are identified by the 
parser. An object tree is generated based on the HTML 
elements identified and the hierarchical structure inherent in 
HTML and defined in the HTML template(s). 

[0070] An HTML object is instantiated for each of the 
HTML elements defined in the HTML template(s) at step 
308. At step 310, the objects instantiated in step 308 are used 
to render the Web page definition (i.e., HTML document). 
The HTML document is sent to the client browser at step 
312. The client browser interprets the HTML document and 
displays the Web page. When the client submits the page, the 


page is returned to the HTTP server as an HTML document. 
The HTML document is received at step 314. At step 316, 
the HTML objects are used to perform the processing based 
on the client input. Processing ends at step 318. 

[0071] The object tree generated during parsing identifies 
the objects and hierarchy defined in an HTML template. 
FIG. 4 provides a diagram of the object tree or hierarchy for 
the HTML template provided in Table One. 

[0072] Page object 402 corresponds to the HTML element. 
The BODY element is contained within the HTML element 
hierarchy. This hierarchy is represented in the object tree by 
a parent-child relationship. Body object 404 is a child of its 
parent, page object 402. Similarly, string object 406, para- 
graph object 408, and ordered list object 410 are children of 
their parent, body object 404. String object 406 corresponds 
with the “This is the list of employees:” text string in Table 
One. Paragraph object 408 and ordered list object 410 
correspond with the paragraph and ordered list elements, 
respectively, in the HTML document. Below the ordered list 
element is a pairing of a list item element and string element 
in the HTML document in Table One. 

[0073] Before an HTML document is sent to the client, the 
HTML objects render the document. An HTML document 
consists of HTML statements. HTML statements define 
HTML elements. An HTML document defines the compo- 
nents of a Web page. The HTML document is sent to client 
browser 202 from HTTP Server 206, for example. Browser 
202 identifies the components of the Web page by parsing 
the HTML document and generates the Web page. 

[0074] Before the H TML document is transmitted to client 
browser 202 by HTTP server 206, it is generated using the 
object-oriented capabilities of the present invention. As 
previously indicated, an HTML document is defined using 
HTML templates created using a process such as the one 
illustrated by FIG. 3. FIG. 5 provides an example of a flow 
for processing stored templates for an HTML document. 

[0075] At step 502, a determination is made whether all 
templates needed to define the HTML document have been 
processed. If all of the templates have been processed, 
processing ends at step 504. If all of the HTML templates 
have not been processed, processing continues at step 506 to 
retrieve the next stored template. At step 508, the template 
is parsed and an object tree is generated. Each object 
identified in the object tree corresponds to an HTML ele- 
ment defined in the HTML template. Each object in the tree 
corresponds to an object class. For each element in the 
HTML template, an instance of the corresponding object 
class is instantiated. At step 510, a determination is made 
whether all of the objects in the template’s object tree have 
been processed. If they have, processing continues at step 
502 to determine whether all of the HTML document’s 
templates have been processed. 

[0076] If the template’s object tree has not been processed, 
processing continues at step 512 to instantiate the next 
object in the object tree. At step 514, pointers to a script can 
be set in the instantiated object’s instance variables, for 
example. Element attribute information is stored in the 
object’s instance variables at step 516. Processing continues 
at step 510 to process any remaining portion of the object 
tree. 

[0077] As illustrated in FIGS. 3 and 5, a parsing routine 
is used at HTML document generation time. Any parser can 
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be used with the present invention. Parsers and parsing 
routines can be downloaded from various sites on the 
Internet. The parsing process is performed during document 
generation to retrieve HTML element information and to 
generate an object tree. FIG. 6A provides an example of a 
process flow to parse a template and generate an object tree. 

[0078] At step 602, a determination is made whether the 
template has been parsed. If it has, processing ends at step 
618. If the template has not been completely parsed, pro- 
cessing continues at step 604 to parse the next element 
definition in the template. If, at step 606, a determination is 
made whether a syntax error exists in the element definition, 
processing continues at step 608 to raise an error condition. 
If an error occurs, processing ends at step 618. Alternatively, 
processing can continue to parse the rest of the template. 

[0079] If no syntax error occurred, the parser proceeds to 
identify the type of element defined by the current element 
statement. At step 610, the object class that corresponds to 
the element type is identified. The object is added to the 
object tree at step 612. Processing continues at step 602 to 
determine if additional parsing is necessary. 

[0080] An HTML document can also be generated without 
using an HTML template. An object tree can be created 
dynamically during runtime. PAGE object 402 is instanti- 
ated at runtime, for example. During processing, additional 
HTML objects can be instantiated and inserted into the 
object tree as children, grandchildren, great-grandchildren, 
etc. of PAGE object 402. For example, BODY object 404 is 
instantiated at runtime based on logic contained in an 
application procedure. BODY object 404 is inserted in the 
object tree as a child of PAGE object 402. Attributes of the 
BODY object can be set in the application processing. The 
remaining portion of the object tree in FIG. 4 can be created 
dynamically as well. 

[0081] An object tree that is created dynamically without 
using HTML templates, or one that is created based on one 
or more HTML templates can be modified dynamically at 
runtime. Once the object tree is generated, the HTML 
statements of an HTML document are generated by sending 
a "create” message to the objects in the object tree. 

[0082] There is a standard set of HTML elements that are 
understood by most of the Web browsers. Some Web brows- 
ers offer additional HTML elements or extensions. These 
extensions provide additional functionality. However, 
because extensions are not part of the standard HTML, one 
browser’s HTML extension may not be interpretable by 
another browser. 

[0083] The extensions offered by client browsers are inter- 
preted on the client side. The present invention provides an 
extension to HTML that is used on the HTTP server side. 
The HTML extension is filtered out before a Web page is 
sent to a client browser. It is used to interpret an HTML 
template and to render an HTML document before is trans- 
mitted to the client browser. Because HTML extensions are 
not sent to the client browser, there is no need for a client 
browser to support these extensions. Thus, existing client 
browsers that support the standard HTML can be used to 
display HTML documents using the present invention. 

[0084] The present invention uses an extension to the 
standard HTML known as the group extension The group 
extension provides the ability to combine HTML elements 


or statements in a single block. A name attribute of the group 
extension provides the ability to identify the block, or group, 
by name. The name attribute can be stored in a hash table. 
During processing, the group identity can be retrieved and 
used to traverse an object tree. 

[0085] The group extension also ensures the scoping of 
named elements (i.e., elements that have a name by virtue of 
a name attribute). A group object maintains a hash table that 
includes the named elements which are its members. The 
group element scopes the named elements within itself. 
Therefore, two elements having the same name in different 
groups are distinguishable. Each one is scoped to its respec- 
tive group. For example, a group element named "Nico” 
contained within a parent group named "Employee” is 
distinct from a group named “Nico” within a parent group 
named “Tourist”. 

[0086] A declarations file is used in combination with the 
group extension. The declarations file contains additional 
definition for a group extension. An entry in the declarations 
file includes a label that links the entry to the group 
extension. The entry also declares an HTML object. That is, 
the class of the HTML object is defined. Properties of the 
HTML object can also be defined within the entry. Values for 
properties provided in the object class definition can be used 
to populate the properties in an instance of the object class 
instantiated for the group. 

[0087] A declaration entry modifies its associated group 
by adding elements to the group or modifying the elements 
that already exist in the group. For example, an HTML 
object, or element, declared in the entry modifies the group 
by inserting itself into the group that bears the same name as 
the declaration entry. Property values that are declared in the 
declaration entry are used to modify the HTML object’s 
properties. 

[0088] An NSWTAG marker is used to identify the 
bounds of the group. The following is an example of a 
format for an HTML group extension statement using the 
NSWTAG marker: 

[0089] <NSWTAG NAME="GREETING”></ 

NSWTAG> 

[0090] In this example, the group, or NSWTAG extension, 
does not contain an HTML statement. The NSWTAG exten- 
sion is identified by the <NSWTAG> and </NSWTAG> 
delimiters. The NAME portion of the extension specifies the 
name of the group. This name can be used to link to a 
declaration in a declaration file. An example of an entry 
contained in an external definition for “GREETING” is as 
follows: 

[0091] GREETIN G : NSWS tring {value=“Welcome to the 
Cyber Surfshop”} 

[0092] The group extension is associated with the decla- 
ration file entry using the name attribute in the NSWTAG 
extension statement. The same name is provided as a label 
in the declaration file. The label field of an entry is the first 
field in the entry and is located to the left of the colon (e.g., 
“GREETING). To the right of the label (after the colon) in 
the declaration file entry is an indication of an object class. 
The object class field specifies the class of HTML object that 
is to be instantiated and inserted in the “GREETING” group. 
The “NSWString” specification in the “GREETING” entry 
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results in the creation of an instance of the “NSWString” 
HTML object class. The NSWString class provides object 
cover for free-floating text. The “NSWString” HTML object 
instance inserts itself into the “GREETING” group. 

[0093] The declaration contains a property field (i.e., 
delimited by brackets). The property field, value, is used to 
initialize a property in the “NSWString” HTML object. The 
value field contains the text that is to be inserted into the 
HTML document. When the “GREETING” group portion of 
the object is rendered, a “create” message is sent to the 
“GREETING” group object. It does not generate any HTML 
statements in the HTML document. However, it sends a 
“create” message to its children. As its child, the “NSW- 
String” HTML object receives the “create” message. The 
"NSWString” HTML object inserts the text contained in the 
value property into the HTML document. 

[0094] The group extension can be used to insert a block 
of HTML statements into an HTML document. The block of 
HTML statements can occur once in the HTML template. 
The block can also be repeated within the HTML document. 
FIG. 7A provides an example of a Web page that contains 
block that is repeated (i.e., a repeating subcomponent). 

[0095] Web page 702 provides a list of a client’s unread 
electronic mail. As displayed in Web page 702, a mail item 
704 contains information about the sender (name and 
image), subject, and content of the mail message. Mail item 
704 is repeated for each piece of electronic mail. A hori- 
zontal rule separates each mail item. The sender, subject, 
content and horizontal rule elements constitute the repeating 
subcomponent of Web page 702. 

[0096] A group element acts outer block of a group of 
HTML elements that generate the HTML statement block 
(either repeating or nonrepeating). In an object tree, the 
group object acts as the root for the HTML objects that 
correspond to the HTML elements. FIG. 7B provides an 
example of the object tree that is symbolic of the repeating 
subcomponent of Web page 702 of FIG. 7A. The group 
object 712 is the parent of the objects 714 contained within 
the group. Each of the objects 714 contained within the 
group are children of group object 712. The children of 
group object 712 (i.e., objects 714) can be repeated to handle 
the number of mail items to be displayed to the client. 

[0097] In addition to defining the root of a repeating 
component, the group extension can be used to define the 
root of a subtree. The subtree can be combined with other 
trees or subtrees to generate a complete object. The com- 
bined object tree can be used to render a Web page. Thus, for 
example, Web page 702 may be defined in two separate 
HTML templates. One template defines the repeating por- 
tion and the other defines the non-repeating portion. FIG. 
7B provides the object tree for the HTML template that 
defines the repeating portion of Web page 702. The group 
extension can be defined in both templates such that the two 
templates can be combined to generate a single HTML 
document. 

[0098] Group extension scan be nested in the same way 
that HTML elements are nested within each other. An outer 
group provides scoping for an inner group. Therefore, a 
parent group object provides scoping for its children. Table 
Two provides an example of HTML that includes an 
NSWTAG extension. 


TABLE TWO 


<HTML> 

<NSWTAG NAME="SELECT_FORM"> 

<FORM> 

<NSWTAG NAME=”SAILBOARDS"> 

<NSWTAG NAME=“CHECK”xINPUT TYPE= 
“checkbox”x/NSWTAG> 

<NSWTAG NAME=“NAME”x/NSWTAG> 
</NSWTAG> 

</FORM> 

</NSWTAG 

</HTML> 


[0099] The first NSWTAG extension provides an identifier 
for the block containing the form element. The NSWTAG 
statement includes a name attribute, "SELECT_FORM”, to 
identify this block, or group. Within the “SELECT_FORM” 
block, a “SAILBOARDS” NSWTAG extension provides an 
identifier for a repeating subcomponent of the HTML. There 
are two NSWTAG extensions within the repeating subcom- 
ponent (i.e., “CHECK” and “NAME”). An input HTML 
element is nested within each of these NSWTAG extensions. 

[0100] The declarations file that is associated with the 
HTML in Table Two contains further definition for the 
NSWTAG extensions. For example, the outer NSWTAG 
extension (i.e., "SELECT_FORM”) has the following asso- 
ciated entry in the declarations file: 

[0101] SELECT_FORM:NSWForm {action=onSelec- 
tion()}; 

[0102] The label portion of the declaration file entry (i.e., 
“SELECT JFORM”) links the entry with the NSWTAG 
extension in the HTML template having the same name. The 
next field in the entry (i.e., located to the right of the colon) 
indicates that an instance of the NSWForm object class 
should be instantiated and inserted within the SELECT_ 
FORM group object. Following the element/object specifi- 
cation, the entry includes a value for a property or attribute 
of the NSWFORM object. That is, the action attribute is 
assigned the value of “onSelection()”. The “onSelectionQ” 
value indicates the procedure to invoke in the script asso- 
ciated with the HTML document when an action is taken by 
the client, for example. 

[0103] The NSWTAG extension named “SAILBOARDS” 
is contained within the SELECT_FORM group. It also 
begins a repeating subcomponent. An example of the dec- 
larations file entry associated with the “SAILBOARDS” 
NSWTAG extension is as follows: 

[0104] SELECT_FORM SAILBOARDS NSWRepetition 
{ list =sailboards; item=sailboard, 

[0105] The label in the declarations file entry specifies that 
the entry is associated with the “SAILBOARDS” group that 
is within the SELECT_FORM group. The entry further 
indicates that the “SAILBOARDS” group is a repetition 
group. The “SAILBOARDS” group can cause a block of 
HTML statements to be repeated in the HTML document. 
The NSWRepetition specification in the entry indicates that 
an repetition object should be instantiated to process the 
repeating subcomponent. The repetition object can contain 
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HTML objects that generate a multiple occurrences of an 
HTML statement block. The number of times, N, that the 
block is repeated is based on data that is to be loaded into the 
HTML (e.g., list of sailboards). 

[0106] The list attribute specified in the “SELECT_ 
FORM.SAILBOARDS” declarations file entry points to a 
container object (i.e., object that holds other objects), “sail- 
boards”. The item attribute points to the information to be 
retrieved to populate the repeating subcomponent for the 
“SAILBOARDS” repetition group. 

[0107] The HTML template and the declarations file con- 
tains entries that further define the “SELECT_FORM.SAIL- 
BOARDS” repetition group. Two group extensions are 
contained within the “SELECT_FORM. SAILBOARDS” 
nested group structure in the HTML template. These groups, 
“CHECK” and “NAME”, have the following declarations, 
respectively: 

[0108] SELECT_FORM. SAILBOARDS. CHECK:N- 
SWInput {value=sailboard, return=selectionArray}; 

[0109] SELECT_FORM.SAILBOARDS.NAME NSW- 
String {value=sailboard}, 

[0110] An object of class “NSWInput” is instantiated 
based on the specification in the “CHECK” NSWTAG 
statement and the “SELECT_FORM.SAILBOARD- 
S. CHECK” declaration file entry. It inserts itself within the 
"SELECT_FORM.SAILBOARDS. CHECK” nested group. 

[0111] Preferably, property values of the newly instanti- 
ated HTML object are retrieved from its corresponding 
declaration. However, property information can be defined 
in the HTML template. For example, the ”SELECT_FORM. 
SAILBOARDS. CHECK” group HTML element in the 
HTML template specifies the type property (i.e., "text) for 
the NSWInput HTML object. This property could have been 
specified in the corresponding declaration. 

[0112] The "SELECT_FORM.SAILBOARDS.CHECK” 
declaration contains property information. The “value” 
attribute points to a sailboard object that contains the name 
of the first/next sailboard to populate the repeating subcom- 
ponent in the HTML document. The return attribute speci- 
fies the variable, "selectionArray” into which an input value 
is pushed when the HTML document returns from the client. 
The HTML object that is instantiated from the “SELECT_ 
FORM.SAILBOARDS. CHECK” declaration generates an 
INPUT HTML statement of type “checkbox” when it is 
receives the "create” message. 

[0113] The “SELECT_FORM.SAILBOARDS.NAME” 
declaration file entry identifies the NSWString object class. 
An NSWString object is instantiated and inserted into the 
SELECT_FORM.SAILBOARDS” nested group. The 
"NSWString” object generates a string that is equal to the 
value of the first/next sailboard. 

[0114] The NSWTAG extension can be replaced using 
data stored in properties of the object(s) associated with the 
HTML element and NSWTAG extension. Table Three pro- 
vides an example of HTML including NSWTAG extensions 
contained within an ordered list HTML element. The 
NSWTAG extensions provide associations to entries in the 
declarations file that point to data stored in object properties. 


TABLE THREE 


<HTML> 

<NSWTAG NAME=''SELECT_FORM"> 

<FORM> 

<OL> 

<NSWTAG NAME="WEBPEOPLE"> 

<NSWTAG NAME=“FN”xINPUT TYPE="text"> 
</NSWTAG> 

<NSWTAG NAME=“LN'’xINPUT TYPE="text"> 
</NSWTAG> 

<P> 

</NSWTAG> 

</OL> 

</FORM> 

</NSWTAG 

</HTML> 


[0115] A "WEBPEOPLE” NSWTAG extension is con- 
tained within the “SELECT_FORM” NSWTAG extension. 
The “WEBPEOPLE” NSWTAG extension includes two 
nested group HTML elements. Associated with each HTML 
group elements is a corresponding declaration file entry. The 
following provides an example of declaration file entries for 
the “WEBPEOPLE”, "FN”, and "LN” NSWTAG exten- 
sions, respectively: 

[0116] SELECT_FORM.WEBPEOPLE:NSWRepetition 
{list=web_dept; item=employee}; 

[0117] SELECT_FORM.WEBPEOPLE.FN:NSWInput 
{value=employee fn}, 

[0118] SELECT_FORM.WEBPEOPLE.LN:NSWInput 
{value=employee.ln}; 

[0119] The “WEBPEOPLE” declaration file entry indi- 
cates that the "WEBPEOPLE” NSWTAG extension is a 
repeating subcomponent or group of HTML. Group objects 
are instantiated for each. When the HTML document is 
rendered, the “WEBPEOPLE” group object sends a “create” 
message to the “FN” and “LN” group objects. The “FN” and 
“LN” group objects send a “create” message to the two 
“NSWInput” objects instantiated as a result of the 
“SELECT_FORM. WEBPEOPLE. FN” and “SELECT_ 
FORM.WEBPEOPLE.LN” declarations. The declarations 
specify the data associated with the “fn” and “In” properties 
of an employee object. The “NSWInput” objects generate 
HTML statements in the HTML document using the values 
of the “fn” and “In” properties of the employee object. 

[0120] The “FN” and “LN” groups are contained within 
the “WEBPEOPLE” group. The “WEBPEOPLE” group is a 
repetition group. Therefore, “FN” and “LN”groups can be 
activated multiple times to generate HTML statement blocks 
within the HTML document. 

[0121] The name property of the group, or NSWTAG, 
extension provides the ability to identify the group and the 
statements contained within the group. During parsing, a 
name property associated with a group can be stored in a 
hash table. During processing, an object can be located by 
traversing the object tree. Preferably, however, one object 
can retrieve group names from the hash table to locate 
another object within the object tree. FIG. 6B modifies the 
flow provided in FIG. 6A to include steps to add tag name 
information to a hash table. 
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[0122] Steps 614 and 616 are added to the flow. After the 
object is added to the object tree at step 612, a determination 
is made whether the parsed statement contained a name 
property. If there is no name property, processing continues 
at step 602 to process any remaining elements of the HTML 
template. If a name property is defined, processing continues 
at step 616 to store the identifier in the hash table. Processing 
continues at step 602 to process any remaining elements of 
the HTML template. 

[0123] Thus, a method and apparatus for generating 
object-oriented world wide web pages is provided. 

APPENDIX A 


Class 


Description 


NSWHAnchor 

NSWHCheckbox 

NSWHComment 

NSWHElement 

NSWHForm 

NSWHGroup 

NSWHImage 

NSWHImagelnput 

NS WHImagemap Region 
NSWHInput 

NSWHKeyValueCoding 


NSWHList 


NSWHListltem 

NSWHMarkedElement 

NSWHMenu 

NSWHOption 

NSWHOrderedList 

NSWHPage 


Object cover for Anchor HTML element 
Object cover for Input HTML element, type is 
“checkbox” 

Object cover for comments 

Object cover for Form HTML element 
Cover for group extension. A group is a 
container for other objects. 

Object cover for Image HTML element 
Object cover for Input HTML element, type is 
“image” 

Abstract class for input HTML elements. All 
input HTML elements subclass this class 
Defines methods to maintain a dictionary of 
data values associated with keys. Values can be 
extracted from the dictionary to initialize an 
object, or retrieved from an object and stored 
in the dictionary. 

A generic container class for HTML elements 
that can contain other elements or plain text. 
This class can be used for HTML elements that 
do not covered by another object class 
Object cover for List Item HTML element 

Object cover for Menu HTML element 
Object cover for Option HTML element 
Object cover for Ordered List HTML element 


APPENDIX A-continued 


Class 

Description 

NSWHRadio 

Object cover for 


“radio” 

NSWHRectRegion 


NSWHRepetitionGroup 

Object cover for 


extension 

NSWHReset 

Object cover for 


“reset” 

NSWHSelect 

Object cover for 

NSWHString 

Object cover for 

NSWHSubmit 

Object cover for 


“submit” 

NSWHText 

Object cover for 


“text” 

NSWHTextArea 

Object cover for 

NSWHParser 


NSWHUnknownElement 

Object cover for 


element 

NSWHUnorderedList 

Object cover for 

NSWHUrl 

Object cover for 

NSWHUrn 

Object cover for 


Input HTML element, type is 


repeating group HTML 

Input HTML element, type is 

Select HTML element 
free-floating HTML text 
Input HTML element, type is 

Input HTML element, type is 

TextArea HTML element 

Unordered List HTML 

Input HTML element 
Uniform Resource Locator 
URN 


1. A method of processing Hypertext Markup Language 
(HTML) elements using objects, comprising the steps of: 

defining object classes for each of said elements of said 
HTML; ' 

creating an HTML template, said HTML template having 
a statement that defines one of said HTML elements; 

identifying the object class associated with said one of 
said HTML elements; 

creating an instance of said object class, said instance 
containing data and methods for processing said one of 
said HTML elements; and 

transmitting a message to said instance to perform an 
operation using the data and methods of said instance. 
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